Picture for Mohammad Soleymani

Mohammad Soleymani

Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox

Add code
May 26, 2026
Viaarxiv icon

The manifold of unitary and symmetric matrices: characterization, Riemannian optimization and application to BD-RIS design

Add code
Apr 24, 2026
Viaarxiv icon

Optimal symmetric low-rank BD-RIS configuration maximizing the determinant of a MIMO link

Add code
Apr 10, 2026
Viaarxiv icon

GDPO-Listener: Expressive Interactive Head Generation via Auto-Regressive Flow Matching and Group reward-Decoupled Policy Optimization

Add code
Mar 26, 2026
Viaarxiv icon

RIS-Aided RSMA Improves the Latency vs. Energy Trade-off in the Finite Block Length MIMO Downlink

Add code
Mar 16, 2026
Viaarxiv icon

MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization

Add code
Mar 03, 2026
Viaarxiv icon

HairWeaver: Few-Shot Photorealistic Hair Motion Synthesis with Sim-to-Real Guided Video Diffusion

Add code
Feb 11, 2026
Viaarxiv icon

AVERE: Improving Audiovisual Emotion Reasoning with Preference Optimization

Add code
Feb 04, 2026
Viaarxiv icon

Riemannian optimization on the manifold of unitary and symmetric matrices with application to BD-RIS-assisted systems

Add code
Jan 20, 2026
Viaarxiv icon

Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery

Add code
Oct 02, 2025
Figure 1 for Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
Figure 2 for Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
Figure 3 for Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
Figure 4 for Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
Viaarxiv icon